SemanticScuttle - klotz.me » Tags: rag+machine learning

Tags: rag* + machine learning*

0 bookmark(s) - Sort by: Date ↓ / Title /

A curated collection of Awesome LLM apps built with RAG, AI Agents, Multi-agent Teams, MCP, Voice Agents, and more. This repository features LLM apps that use models from OpenAI, Anthropic, Google, xAI and open-source models like Qwen or Llama.

2025-09-15 Tags: llm, rag, agents, open source, python, machine learning, github by klotz

Google DeepMind Finds a Fundamental Bug in RAG: Embedding Limits Break Retrieval at Scale

Google DeepMind research reveals a fundamental architectural limitation in Retrieval-Augmented Generation (RAG) systems related to fixed-size embeddings. The research demonstrates that retrieval performance degrades as database size increases, with theoretical limits based on embedding dimensionality. They introduce the LIMIT benchmark to empirically test these limitations and suggest alternatives like cross-encoders, multi-vector models, and sparse models.

2025-09-05 Tags: rag, retrieval-augmented generation, embeddings, google deepmind, limit benchmark, ai, machine learning, sparse models, cross-encoders, multi-vector models by klotz

Why Your RAG Embeddings Are Costing You a Fortune (And How I Fixed It)

This article details the often overlooked cost of storing embeddings for RAG systems, and how quantization techniques (int8 and binary) can significantly reduce storage requirements and improve retrieval speed without substantial accuracy loss.

2025-04-30 Tags: rag, embedding, vector database, transformers, llm, quantization by klotz

The power of the humble embedding

Ryan speaks with Edo Liberty, Founder and CEO of Pinecone, about building vector databases, the power of embeddings, the evolution of RAG, and fine-tuning AI models.

2025-04-02 Tags: pinecone, machine learning, embedding, vector databases, semantic search, rag by klotz

Retrieval Augmented Generation in SQLite

The article explores the concept of Retrieval-Augmented Generation (RAG) using SQLite, specifically with the sqlite-vec extension and the OpenAI API. It outlines a simplified approach to RAG, moving away from complex frameworks and cloud vector databases, using SQLite's virtual tables for vector search and semantic understanding.

2025-02-20 Tags: rag, llm sqlite, sqlite-vec, vector search, machine learning, data science by klotz

New Technique Makes RAG Systems Much Better at Retrieving the Right Documents

Researchers from Cornell University developed a technique called 'contextual document embeddings' to improve the performance of Retrieval-Augmented Generation (RAG) systems, enhancing the retrieval of relevant documents by making embedding models more context-aware.

Standard methods like bi-encoders often fail to account for context-specific details, leading to poor performance in application-specific datasets. Contextual document embeddings address this by enhancing the sensitivity of the embedding model to subtle differences in documents, particularly in specialized domains.

The researchers proposed two complementary methods to improve bi-encoders:

- Modifying the training process using contrastive learning to distinguish between similar documents.
- Modifying the bi-encoder architecture to incorporate corpus context during the embedding process.

These modifications allow the model to capture both the general context and specific details of documents, leading to better performance, especially in out-of-domain scenarios. The new technique has shown consistent improvements over standard bi-encoders and can be adapted for various applications beyond text-based models.

2024-10-10 Tags: rag, embedding, document retrieval, llm by klotz

Advanced RAG Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

2024-08-01 Tags: rag, nlp, machine learning, information retrieval, natural language processing, llm, embeddings, semantic search by klotz

A Step-by-Step Guide to Building and Distributing a Sleek RAG Pipeline

Walkthrough on building a Q and A pipeline using various tools, and distributing it with ModelKits for collaboration.

2024-07-10 Tags: llm, rag, kitops, python, machine learning, mlops, chromadb by klotz

The Challenges of Retrieving Relevant Context for RAG

Case study on measuring context relevance in retrieval-augmented generation systems using Ragas, TruLens, and DeepEval. Develop practical strategies to evaluate the accuracy and relevance of generated context.

2024-06-11 Tags: natural language processing, rag, machine learning, llm by klotz

Overcoming the Limits of RAG with ColBERT

ColBERT is a new way of scoring passage relevance using a BERT language model that substantially solves the problems with dense passage retrieval.

2024-03-12 Tags: llm, rag, embedding, bert, colbert, cosine distance, concept expansion by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: rag* + machine learning*

Linked Tags

Related Tags